Identification of Amino Acid Propensities That Are Strong Determinants of Linear B-cell Epitope Using Neural Networks

نویسندگان

  • Chun-Hung Su
  • Nikhil R. Pal
  • Ken-Li Lin
  • I-Fang Chung
چکیده

BACKGROUND Identification of amino acid propensities that are strong determinants of linear B-cell epitope is very important to enrich our knowledge about epitopes. This can also help to obtain better epitope prediction. Typical linear B-cell epitope prediction methods combine various propensities in different ways to improve prediction accuracies. However, fewer but better features may yield better prediction. Moreover, for a propensity, when the sequence length is k, there will be k values, which should be treated as a single unit for feature selection and hence usual feature selection method will not work. Here we use a novel Group Feature Selecting Multilayered Perceptron, GFSMLP, which treats a group of related information as a single entity and selects useful propensities related to linear B-cell epitopes, and uses them to predict epitopes. METHODOLOGY/ PRINCIPAL FINDINGS We use eight widely known propensities and four data sets. We use GFSMLP to rank propensities by the frequency with which they are selected. We find that Chou's beta-turn and Ponnuswamy's polarity are better features for prediction of linear B-cell epitope. We examine the individual and combined discriminating power of the selected propensities and analyze the correlation between paired propensities. Our results show that the selected propensities are indeed good features, which also cooperate with other propensities to enhance the discriminating power for predicting epitopes. We find that individually polarity is not the best predictor, but it collaborates with others to yield good prediction. Usual feature selection methods cannot provide such information. CONCLUSIONS/ SIGNIFICANCE Our results confirm the effectiveness of active (group) feature selection by GFSMLP over the traditional passive approaches of evaluating various combinations of propensities. The GFSMLP-based feature selection can be extended to more than 500 remaining propensities to enhance our biological knowledge about epitopes and to obtain better prediction. A graphical-user-interface version of GFSMLP is available at: http://bio.classcloud.org/GFSMLP/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of B-cell Linear Epitopes with a Combination of Support Vector Machine Classification and Amino Acid Propensity Identification

Epitopes are antigenic determinants that are useful because they induce B-cell antibody production and stimulate T-cell activation. Bioinformatics can enable rapid, efficient prediction of potential epitopes. Here, we designed a novel B-cell linear epitope prediction system called LEPS, Linear Epitope Prediction by Propensities and Support Vector Machine, that combined physico-chemical propensi...

متن کامل

Machine learning approaches for prediction of linear B-cell epitopes on proteins.

Identification and characterization of antigenic determinants on proteins has received considerable attention utilizing both, experimental as well as computational methods. For computational routines mostly structural as well as physicochemical parameters have been utilized for predicting the antigenic propensity of protein sites. However, the performance of computational routines has been low ...

متن کامل

B and T-Cell Epitope Prediction of the OMP25 Antigen for Developing Brucella melitensis Vaccines for Sheep

Brucellosis, produced by Brucella species, is a disease that causes severe economic losses for livestock farms worldwide Due to serious economic and medical consequences of this disease, many efforts have been made to prevent the infection through the use of recombinant vaccines based on Brucella outer membrane protein (OMP) antigens. In the present study, a wide range of on-line prediction sof...

متن کامل

Compatibility of B-Sheets with Epitopes Predicted by Immunoinformatic in Human IgG

Background & Aims: Antibodies, well-known as immunoglobulins (Igs), are produced by B lymphocytes and specifically defend against pathogens. Igs are glycoproteins and have high diagnostic value in several diseases including infections (1). Igs are composed of light and heavy chains (2, 3). Each chain is comprised of about 110-120 amino acid residues which create immunoglobulin folds named domai...

متن کامل

Engineering Application Of Correlation on Ann Estimated Mass

A functional relationship between two variables, applied mass to a weighing platform and estimated mass using Multi-Layer Perceptron Artificial Neural Networks is approximated by a linear function. Linear relationships and correlation rates are obtained which quantitatively verify that the Artificial Neural Network model is functioning satisfactorily. Estimated mass is achieved through recallin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2012